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Abstract 

Yamakami ( Theoret. Comput. Sci., 2011) studies context-free lan¬ 
guages with advice functions. Here, the length of an advice is assumed to 
be the same as that of an input. Let CFL and CFL/n denote the class of 
all context-free languages and that with advice functions, respectively. We 
let CFL(2) denote the class of intersections of two context-free languages. 
An interesting direction of a research is asking how complex CFL(2) is, rel¬ 
ative to CFL. Yamakami raised a problem whether there is a CFL-immune 
set in CFL (2) - CFL/n. The best known so far is that LSPACE - CFL/n 
has a CFL-immune set, where LSPACE denotes the class of languages 
recognized in logarithmic-space. We present an affirmative solution to his 
problem. Two key concepts of our proof are the nested palindrome and 
Yamakami’s swapping lemma. The swapping lemma is applicable to the 
setting where the pumping lemma (Bar-Hillel’s lemma) does not work. 
Our proof is an example showing how useful the swapping lemma is. 

Keywords: context-free language; push-down automaton; advice func¬ 
tion; non-uniform complexity class; immune set. 


1 Introduction 

The regular languages have beautiful closure properties. For example, given 
two regular languages, their intersection is a regular language. Nevertheless, 
in the studies of programming languages, most of important languages are not 
regular. The same holds in the studies of formal models of natural languages. 
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In the case of classes larger than the regular languages, closure properties are 
more difficult than the regular cases. 

In particular, given two context-free languages, their intersection is not nec¬ 
essarily context-free. For a positive integer k , we consider the intersection of k 
context-free languages, and let CFL(fc) denote the class of all such intersections. 
CFL(l) is CFL, the class of all context-free languages. It is known that CFL(fc) 
is a proper subset of CFL(fc + 1). 

How complex is CFL(£; +1), relative to CFL(fc)? An interesting observation 
is given by Flajolet and Steyaert [3] . Let L 3eq denote the set of all strings of the 
form 0”l"2 n where n is a natural number. It is easily seen that L 3eq belongs to 
CFL(2). Flajolet and Steyaert observed that L 3eq is CFL-immune. 

Here, an immune set is a key concept in the classical recursion theory, namely 
in Post’s problem. Later, immune sets relative to complexity classes are studied 
in the complexity theory m- Given a class C of languages, an infinite language 
A is C-immune if no infinite subset of A belongs to C. 


Yamakami’s problem Yamakami [5] raised a problem whether there is a 
CFL-immune set in CFL(2) - CFL /n. 


Here, the n of CFL /n denotes the length of an input. 


Definition 1. (Tadaki et al. [B]) Given a class C of languages, we define C/n 
as follows. Suppose that L is a language over an alphabet E. Suppose that 

T E 

r is another alphabet. We introduce an extended alphabet ^ . It consists 


of all symbols of the form 

£ E n and a = a i 


for x £ E and a £ T. Given two strings 


x = Xi■■ 


denote the string 


Xl 

a i 


X„ 

(In 


■ ■ a„ £ T™ of the same length, we let 
E 

r 


A language L belongs to C/n if and only if there exist a language L' £ C 
and a function h : N —» T* such that for every x £ E*, the length of h{ |x|) is 
the same as that of x and the following holds. 


x £ L 


X 

MM) 


£ L' 


Then, h is called an advice function. h(n) is the advice at length n. □ 

Motives for the problem of Yamakami are the following examples on DCFL 
(the class of languages accepted by deterministic pushdown automata) and REG 
(the regular languages). 

(i) Let L eq denote the set of all strings of the form 0"1" (n > 1). Then L eq 
belongs to DCFL n REG/n and is REG-immune [3]. 

(ii) Let Palj denote the palindromes whose center symbol is a special symbol 
j). Then Paljj belongs to DCFL — REG/n and is REG-immune [8]. 
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The CFL-immune set L 3 eq given in [5] belongs to CFL(2) fl CFL/n. Thus, it 
is natural and interesting to ask whether there is a CFL-immune set in CFL(2) 
- CFL/n. 

The best known so far is that LSPACE - CFL/n has a CFL-immune set 
[5j, where LSPACE denotes the class of languages recognized by deterministic 
Turing machines with a single read-only input tape and a logarithmic-space 
bounded work tape. 

What is the difficult point in the problem of Yamakami? A classical method 
of showing that a language is not context-free is the pumping lemma for CFL 
(Bar-Hillel’s lemma [I]). However, the pumping lemma destroys the advice h(n). 

Our main theorem is an affirmative solution to the problem of Yamakami. 
Two key concepts of our proof are the nested palindrome and Yamakami’s swap¬ 
ping lemma. Our test language is introduced in section [3] The swapping lemma 
is applicable to the setting where the pumping lemma does not work. Our proof 
is an example showing how useful the swapping lemma is. We show our main 
theorem in section [I] 

There is a study of advised context-free languages earlier than Tadaki et 
al. [6]. In the setting of Damm and Holzer [2], the arrangement of the advice 
and the input is serial. In section O we show that the same result as our main 
theorem holds for the advised language class of Damm and Holzer. 


2 Preliminaries 


2.1 Notation 


For two sets A and B , their difference A — B is (x G A : x ^ B}. A C B 
denotes that A is a subset of B; A may equal to B. N = {0,1,2,...} is the set 
of all natural numbers. For a real number x, \x~\ denotes the minimal natural 
number n > x. 

The empty string is denoted by A. An alphabet denotes a finite set of char¬ 
acters. For an alphabet E, the set of all strings is denoted by E*. We let E + 
denote E* — {A}. Given a string w, its length |iy| denotes the total number 
of occurrences of characters. The reverse of w = w\ ■ • -Wn, where n = |u>|, is 
w n - ■ -w\. The reverse of w is denoted by w R . 

REG (CFL, respectively) is the class of all regular (context-free) languages. 
Suppose that C is a given class of languages such as REG or CFL. An advised 
class C/n is defined as in Introduction. 

The class CFL/n is characterized by non-deterministic pushdown automata 
with an advice function (Fig. [[]). It has a one-way read-only input tape and a 
pushdown memory (stack). The input tape has two tracks. An input is given 
on the first track. The advise at the length of the input is given on the second 
track. Then the automaton works as a non-deterministic automaton over the 
Y 

alphabet ^ 

C(2) is the class of all languages that are intersections of two elements of 
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pushdown memory (stack) 
$ is the right-end symbol. 


Zk 


Z2 


z 1 


ct is the left-end symbol. 



Xi 

x 2 


%n—l 

X n 

$ 

a 1 

a 2 


ttn—l 

a n 



A 


9o 


h[n ) = a\ ■ ■ ■ a n in the 2nd track 
is the advice at length n. 


head (reading a symbol 


Xi 

ai 


) 


the current state 


Figure 1: A non-deterministic pushdown automaton with an advice 


C. We have REG(2) = REG. On the other hand, CFL is a proper subset of 
CFL(2). 

An infinite language L is C-immune if L does not have an infinite subset 
that belongs to C. 

2.2 The Swapping Lemma for Context-free Languages 

Suppose that n is a positive integer, S' is a set of strings of length n, i and j are 
natural numbers such that i + j < n, and that u is a string over £ of length j. 
Yamakami |7] defines a subset Si iU of S as follows. 

Si,u = {vi ■ ■ ■ v n G S : v i+ i ■ ■ ■ v i+j = u} 

Thus, the definition of Sj iU depends on j but we omit the suffix j. The swapping 
lemma asserts that if the ratio |Si jM |/|S| is small enough for any i,j,u (with 
certain properties) then there exist two strings x,y £ S such that x' and y' 
belong to L , where strings x' and y' are obtained by swapping the midsections 
of x and y , and such that the midsections are different. 

Lemma 1 (The Swapping Lemma for Context-free languages [7]). Suppose that 
£ has at least two letters, and that L is an infinite context-free language over 
£. Then, there exists a positive integer m, a swapping lemma constant, with 
the following properties. 

Suppose that n > 2 is a natural number, S is a subset of LC\T, n , and jo, k are 
natural numbers such that 2 < jo, 2 jo < k < n, and such that for any positive 
integer i < n — jo and any string u £ £ 7 °, we have: 

|Si, u | < \S\/m(k - j 0 + 1 )(n - j 0 + 1) 

Then, there exist positive integers i,j and two strings x = X 1 X 2 X 3 ,y = J/ 1 J/ 2 J /3 £ 
S with the following properties: i + j < n, jo < j < k, \xf\ = \y\\ = i, 
M = |?/ 2 1 = j, |* 3 1 = 1 2 / 3 1 X 2 ± V 2 , xiy 2 x 3 £ L, and y\x 2 y 3 £ L. 
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3 Our Test Language 

Definition 2. Suppose that we regard each natural number as a letter. Suppose 
E is a finite subset of N and w = w\ ■ ■ ■ w n is a string over E. Given a natural 
number c, let E xc denote {cxm : m £ E}. We let ( w) Xc denote the string over 
E xc given by each component of w multiplied by c. 

(w i • • • w n ) xc = '«!••• u„, where Uj = c x Wj , for each j 


□ 


For example, (1211) x3 = 3633. 

Definition 3. We define our test language L 2 as follows (the suffix 2 is that of 
CFL(2)). 

L 2 := {u;(w ii ) X 3(u;) x i5(u; fi: )x5 : w £ { 1 , 2 } + } 

□ 


4 Main Theorem and Its Proof 


Lemma 2. L 2 belongs to CFL(2). 

Proof. We define languages L 2.1 and L 2j 2 as follows. These are clearly context- 
free languages. 

£ 2,1 := {w(w r )x 3 X :»e{l, 2} + , x £ {5,10,15, 30} + } 

L2,2 ■■= {y(y R )x 5 :y£{ 1 , 2 , 3 , 6 }+} 


Then it holds that L 2 = ^ 2,1 H L 2 , 2 - Hence L 2 belongs to CFL(2). □ 

Lemma 3. L 2 is CFL -immune. 


Proof. L 2 is a subset of L' 2 := {wxy : |iu| = |a;|,2|u;| = \y\,w £ {l,2} + ,a; £ 
{3,6} + and y £ {5,10,15, 30} + }. Let L'f := {a n b n c 2n : n £ N}. A standard 
argument based on Bar-Hillel’s lemma shows that L 2 is CFL-immune, and a 
similar argument shows that L 2 is CFL-immune. Hence, L 2 is CFL-immune. □ 


Theorem 4. (Main theorem) There exists a CFL -immune set in CFL(2) — 
CFL /n. 


Proof. By Lemmas [2] and [3J it is sufficient to show that L 2 does not belong to 
CFL /n. We work with Yamakami’s swapping lemma for context-free languages 
[7]. Consult Example 4.2 of [7] for a basic usage of the swapping lemma. For a 
proof by contradiction, fix a function h and a context-free language L such that 
Vn \h(n)\ = n , and such that for any £ £ {1, 2, 3, 6, 5,10,15, 30}*, the following 
holds. 


f £ L 2 £4 


£ 

m) 


£ L 
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Let to be a swapping lemma constant for the context-free language L. Let 
n be a multiple of 16 with the following property. 

2"/ 4 > (2mn 2 ) 4 (1) 

We define a subset S of L as follows. 

G L : £ G {1,2,3,6,5,10,15,30}”| 

Since the string w(w r ) X 3 (w) x io(w r ) X 5 is uniquely determined by w G 
{1,2} + , the following holds. 

\S\ = 2”/ 4 (2) 

Let k and jo be the followings. Here, the base of the logarithm is 2. 

k := n/A (3) 

jo := 2(|"log(mn 2 )] + 1) (4) 

By m and (131), k = n/A > 4(log(mn 2 ) + 1). Since n/A is a multiple of 4, 
k > 4({log(TO?r 2 )] + 1). By 0, we get the following. 

k > 2j 0 (5) 

Given a natural number i and a string u over the alphabet of L such that 
i + jo < n and |it| = jo, we define as follows. 

Si, u = {«i • • • v n G S : v i+1 ■ ■ ■ v i+jo = u} 

When w(w R ) x o(w) x i$(w R )xo is in Si yU , some bits of w are bound by u. We 
are going to estimate the number of bits bound by u. The minimal number is 
achieved when u spans the border of any two blocks with the center of u at the 
border. Thus, at least fjo/2] bits of w are bound by u. Therefore, we have the 
following. 

|Si,„| < 2”/ 4 ^°/ 2 (6) 

Now, we have the following. 

\S i>u \ < \S\/kmn (7) 

This is shown as follows. 2 J °/ 2 > 2 mn 2 [by <(4])] = 8 kmn [by Q] > kmn. 

Thus, 2- 70 / 2 > kmn. By © and {6]), we have shown ([7| . 

By ([5} and 0 , we can apply the swapping lemma for context-free languages 
|7j Lemma 4.1] to the present setting. 

By the swapping lemma, there exist natural numbers i, j and strings x,y £ S 
with the following properties. 

• 1 < i < n — j and jo < j < k{= n/A ) 
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• x,y are of the form x = X\X 2 Xs ,y = yiy 2 y 3 , where each xi and yz are 
strings, and it holds that \xi\ = \yi\ = i, \x 2 \ = I 2/2 1 = j, l^l = |?/ 31 , 
X 2 ± V 2 , xyy 2 x 3 G L and y\x 2 y 3 G L. 


Let £, 77 , £7 and rje (t = 1,2, 3) be the projections of x, y, xc and yi to the first 


track, respectively. For example, x = 


£ 

h{n) 


,y = 


V 

h(n) 


Since x belongs to L and the second component is h(ri), it holds that £ £ L 2 . 
Therefore £ is of the form w(w R ) x3 (w) x i 5 (w R ) x3 for some string w of length 
n/ 4. Here, it holds that \^ 2 \ = j < k = n/ 4 = |w|. Therefore, £2 is included by 
either one of w, (w R ) x 3 , (w) x i 5 and (w R ) x3 or included by consecutive two of 
them. The same holds for 772 - 

Fig. [2] demonstrates the case of i+j < n/A. Here, £2 is included by w. Fig. [3] 
demonstrates the case of i < n/A < i+j. Here, £2 is included by w(w R ) X 3 - The 
other cases are similar. 

The swapping of X 2 and y 2 does not affect the second components h{n) of x 
and y (Fig. [4j). Thus both £^£3 and iir] 2 £, 3 belong to L 2 , and £2 ^ 772 . Hence, 
we get a contradiction. 


£1 

£2 

£3 

i i + j 

w 

(w r ) x 3 (w) x i 5 (w r ) x5 


n/A 


Figure 2: The case of i + j < n/A 


£1 

£2 

£3 


i+j 

w 

(w R ) x 3 

(w) x 15 (w R ) x5 


n/A 

77-/2 


Figure 3: The case of i < n/A < i+ j 
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swap 


6 

€2 

6 

0< 

A ■ ♦ 

vOO> 

Xi 

X2 

^3 

6 

m 

£3 

0< 

A ■ ♦ 

vOO> 

X\ 

2/2 

X3 


Vi 

V2 


0< 

A ■ ♦ 

vOO> 

2/i 

2/2 

2/3 

Vi 

£2 

V3 

0< 

A ■ ♦ 

vOO> 

2/1 

X2 

2/3 


h(n) =Q< A«4 VOO> 


Figure 4: The swapping does not affect the second track 


Thus, we have shown that L 2 does not belong to CFL /n. Hence, we have 
shown the theorem. □ 


5 Serial Advices 

5.1 A Variation of the Main Theorem 

Damm and Holzer [2] investigated advised language classes earlier than Tadaki 
et al. [6]. In the definition of Damm and Holzer, the arrangement of the advice 
and the input is serial, while in that of Tadaki et al., it is parallel. In this section, 
we show that the same result as our main theorem holds for the advised language 
class in the sense of Damm and Holzer. 

Definition 4. (Damm and Holzer [2]) Given a class C of languages, the advised 
language class C/n in the sense of Damm and Holzer is defined as follows. 
Suppose that E and To are alphabets. Suppose that L is a language over E. A 
language L belongs to C/n (in the sense of Damm and Holzer) if and only if there 
exist a language L" G C and a function g : N —>• Tq such that Vn \g(n)\ = n, and 
such that the following holds. 

x G L <t=> g(|x|) x G L" 

Here, g(|:c|) x is the concatenation of g(|a;|) and x. Then, g is called an 
advice function. g(n ) is the advice at length n. □ 

In the remainder of the paper, C/n denotes the advised class in the sense of 
Tadaki, Yamakami and Lin [6), that is, the parallel one defined in Introduction. 
On the other hand, (C/n) ser i a i denotes the advised class defined in this section. 
The following is a variation of our main theorem. 


















Theorem 5. There exists a CFL -immune set in CFL(2) — (CFL/n) ser iai- 

Proof. Let L 2 be the language defined in Definition [3] In the same way as 
the proof of Theorem [2 it is sufficient to show that L 2 does not belong to 
(CFL/n) ser i a i. Fix an advice function g and a context-free language L such that 
for any x £ {1,2,3, 6, 5,10,15, 30}*, the following holds. 

x £ L 2 -H- < 7 (|a:|) x £ L 

Choose natural numbers to, n , fc, jo in the exactly same way as the proof of 
Theorem |U We define a subset S of L as follows. 

S := {g(n) iGl:iG {1,2, 3,6, 5,10,15, 30}”} 

Thus, each member of S has length 2 n. We have |Si jU | < \S\/km(2n). By 
the swapping lemma, there exist natural numbers i,j and strings x and y of 
the following properties. g(n) x and g(n) y are in S and they are of the form 
g{n ) x = XiX 2 Xv, and g(n) y = 2/12/22/3 , where each X( and ye are strings, and it 
holds that |xi| = I2/1I, \x 2 \ = \y 2 \ < n/4, |x 3 | = \y 3 \, x 2 ± y 2 , xiy 2 x 3 G L and 
2 / 1 ^ 22/3 G L . 

Since x 2 and y 2 are not identical, they are not substrings of g{n). Hence, in 
the same way as the proof of Theorem [4j we get a contradiction. 

Thus, we have shown that L 2 does not belong to (CFL/n) ser iai- Hence, we 
have shown the theorem. □ 

5.2 Remarks and Problems 

In this subsection, we discuss separations of parallel advice classes and serial 
advice classes. 

In general, the two concepts of advised classes do not coincide. Recall that 
REG denotes the class of all regular languages. 

Example 1. REG/n is not a subset of (REG/n) ser iai- A proof is as follows. 
Damm and Holzer show that the language L eq = |0”1" : n G N} does not 
belong to (REG/n) ser iai [2 Propositions 1 and 7]. On the other hand, let h(n) 
be o n / 2 l"/ 2 if n is even; 2” otherwise. Then, by means of an advice function h, 
L eq is shown to be in REG/n. □ 

Example 2. (REG/n) se riai is a subset of REG/n. A proof is as follows. Suppose 
L is an element of (REG/n) ser iai- Let L" and g be a regular language and 
an advice function satisfying the requirements in Definition [4j Let M be a 
deterministic finite automaton that accepts L". Given a natural number n, let 
< 7 ( n ) be the state when M has read g(n). Provided that </(„) is given, without 
knowing what <?(n) is, we can simulate the moves of Ad after reading g(n). Then, 
we define h{n) as to be <?(„) 0" -1 . The Os in the tail are just for adjusting the 
length of h(n). Let P be the union of {0} and the set of the states of M. Now, it 
is easy to define a regular language L' satisfying the requirements in Definition}!] 
with respect to T and h. Therefore, L belongs to REG/n. □ 
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A direct proof of Example [T] is given by means of prefix-free Kolmogorov 
complexity. 

Definition 5. [5 

• A string u = u\- ■ ■ u m is a prefix of a string v = v± ■ ■ ■ v n if m < n and 
for each i < m it holds that it,; = v. t . A set S of strings is prefix-free 
if for each u,v G S such that u ^ v, u is not a prefix of v. A partial 
function M : {0,1}* —> {0,1} is called a prefix-free machine if M is a 
partial recursive function and the domain of M is a prefix-free set. 

• For a prefix-free machine M, its descriptive complexity Km ■ {0,1}* —>• NU 
{oo} is defined as follows. Suppose x G {0,1}*. If there exists a € {0,1}* 
such that M(a) = x then Km(x) is the length of a shortest such er. If 
there is no such a then Km(x) is oo. 

• A prefix-free machine R is an optimal prefix-free machine if for each prefix- 
free machine M, there is a constant d (depending on M) such that for each 
x G {0,1}*, K r (x) < K m (x ) + d. 

□ 

It is known that there exists an optimal prefix-free machine [5j Proposi¬ 
tion 2.2.7]. We fix such a machine R, and let K denote K R . For an infinite 
binary string Z : N — > {0,1} and a natural number n, we let Z \ n denote 
Z(Q)Z{1) ■ ■ ■ Z(n — 1). It is known that there exists a binary string Z of the 
following property [5] section 3.2]. 

36 G N Vn G N [K(Z \n)>n-b} (8) 

Example [3] is a refinement of the proof by Damm and Holzer [5] that L eq = 
{0"l n : n G N} £ (REG/n) ser iai. 

Example 3. A direct proof that REG/n is not a subset of (REG/n) ser iai- Let 
Z be an infinite binary string satisfying ©. Let L:={2 fn:nGN}. 

By means of an advice function h(n) = Z \ n, L is shown to be in REG/n. 

We are going to show that L does not belong to (REG/n) ser iai- Assume 
that L belongs to it. Suppose that L" and g are a regular language and an 
advice function satisfying the requirements in Definition [I] Suppose that M is 
a deterministic finite automaton that accepts L”. Let Q be its set of states. For 
each n, let <?(„) be the state when M has read g{n). 

We define a deterministic Turing machine N as follows. An input is an 
ordered pair (q,n) G Q x N. For each y G {0,1}", simulate the moves of M as 
follows. Set the state (of the virtual M) being q. Let M read y. If M accepts 
y 1 return y and halt. 

If the for-loop finishes without any output, then N does not halt. 

By the definition of L" and g, for the input (?(n),^), N outputs Z \ n. In 
addition, by a certain appropriate coding, we may assume that the domain of 
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TV is a prefix free set. For example, code an ordered pair (u\ ■ ■ ■ u m , V\ ■ • • vg) by 
a string u\U\ ■ • • u m u m 01vivi ■ ■ ■ vgVgOl. 

Therefore, Kpj(Z \ n ) is in the order of the length of (<?(„),n). Thus, it 
is 0(log 2 (n)). Hence, by the definition of an optimal machine, K(Z \ n) < 
Kn(Z \ n) + 0(1) = 0(log 2 n). This contradicts to the assumption of ©■ □ 

To our knowledge, we do not know whether the following hold. 

1. CFL/n C (CFL/n) serial ? 

2. (CFL/n) 

serial CFL/n ? 
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